BlogPulse: Automated Trend Discovery for Weblogs
نویسندگان
چکیده
Over the past few years, weblogs have emerged as a new communication and publication medium on the Internet. In this paper, we describe the application of data mining, information extraction and NLP algorithms for discovering trends across our subset of approximately 100,000 weblogs. We publish daily lists of key persons, key phrases, and key paragraphs to a public web site, BlogPulse.com. In addition, we maintain a searchable index of weblog entries. On top of the search index, we have implemented trend search, which graphs the normalized trend line over time for a search query and provides a way to estimate the relative buzz of word of mouth for given topics over time.
منابع مشابه
Survey on Perception of People Regarding Utilization of Computer Science & Information Technology in Manipulation of Big Data, Disease Detection & Drug Discovery
this research explores the manipulation of biomedical big data and diseases detection using automated computing mechanisms. As efficient and cost effective way to discover disease and drug is important for a society so computer aided automated system is a must. This paper aims to understand the importance of computer aided automated system among the people. The analysis result from collected da...
متن کاملIndexing Weblogs One Post at a Time
In order to perform analysis over weblogs, we must first identify the appropriate unit of a weblog that corresponds to a document. We argue in the paper that, for weblogs, the correct unit is the weblog post. A weblog post is a structured document with the following fields: date, timestamp, title, content, permalink and author. We present our approach for segmenting weblogs into posts, which br...
متن کاملEnvisioning With Weblogs
In this position paper we present a vision of how the stories that people tell in Internet weblogs can be used directly for automated commonsense reasoning, specifically to support the core envisionment functions of event prediction, explanation, and imagination.
متن کاملبررسی محتوای یادداشتهای ارسالی و نظرات وبلاگهای فردی و گروهی کتابداری و اطلاعرسانی فارسی
The present study employed a content analysis method for analyzing the posts and comments in 85 individual and 31 collective weblogs published in Farsi on the subject of Library and information science. Studies showed that the average monthly postings in collective weblog are more than individual weblogs, while regarding the comments posted the reverse is true. The highest numbers of postings i...
متن کاملWeblogs: Technology for Instruction and Learning
As weblogs, in its nascent state, are becoming one of the most participated online activities after web surfing, email, and instant messaging, it has been considered more of a trend in net broadcasting than just a fad. The emergence of bloggers and their behavioral models has opened up new research opportunities in many perspectives. This article attempts are twofold; 1) demonstrate how weblogs...
متن کامل